منابع مشابه
Filtering erroneous protein annotation
MOTIVATION Automatically generated annotation on protein data of UniProt (Universal Protein Resource) is planned to be publicly available on the UniProt web pages in April 2004. It is expected that the data content of over 500,000 protein entries in the TrEMBL section will be enhanced by the output of an automated annotation pipeline. However, a part of the automatically added data will be erro...
متن کاملHighlight report: Erroneous sample annotation in a high fraction of publicly available genome-wide expression datasets.
متن کامل
FixPred: a resource for correction of erroneous protein sequences
Protein databases are heavily contaminated with erroneous (mispredicted, abnormal and incomplete) sequences and these erroneous data significantly distort the conclusions drawn from genome-scale protein sequence analyses. In our earlier work we described the MisPred resource that serves to identify erroneous sequences; here we present the FixPred computational pipeline that automatically correc...
متن کاملDictionary-driven protein annotation.
Computational methods seeking to automatically determine the properties (functional, structural, physicochemical, etc.) of a protein directly from the sequence have long been the focus of numerous research groups. With the advent of advanced sequencing methods and systems, the number of amino acid sequences that are being deposited in the public databases has been increasing steadily. This has ...
متن کاملChallenges for protein family annotation
In the wake of the many fruitful genome projects, tools to aid the annotation of proteomic data are sorely needed. Building from relatively simplistic approaches to an integrated system developed in collaboration with text mining experts, we have created tools capable of producing core annotation for protein families; but many challenges remain. Extending and improving these tools should have w...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Bioinformatics
سال: 2004
ISSN: 1367-4803,1460-2059
DOI: 10.1093/bioinformatics/bth938